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Abstract 

We study the unexpected high flexibility of short dsDNA which recently has been reported by 
a number of experiments. Via the Langevin dynamics simulation of our Breathing DNA model, 
first we observe the formation of bubbles within the duplex and also forks at the ends, with the 
size distributions independent of the contour length. We find that these local denaturations at 
a physiological temperature, despite their rare and transient presence, can lower the persistence 
length drastically for a short DNA segment in agreement with experiment. 
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The DNA is a double helix of two single-stranded (ss) backbone chains paired by compli- 
mentary bases via hydrogen-bonding and further stabilized by stacking interaction between 
adjacent base-pair (bp) planes [lj]. Owing to recent advances of single molecule techniques, 
intensive studies have been done on structural changes and mechanical behaviors of double- 
stranded (ds) DNA constrained by external forces and twists. The continuum elastic model, 
called wormlike chain (WLC) model has been very useful in analytically describing the 
micron-scale conformations and elastic response of such DNA The persistence length, 
the measure of segmental orientation correlation, is about 50 nm (equivalently, about 150 
bases along contour) for dsDNA. On the contrary numerous biological facts suggests DNA 
loops more readily on much shorter length scales. Indeed, Cloutier and Widom showed that 
the DNA has much higher cyclization probability than predicted by a WLC of persistence 
length of 50 nm Also, Wiggins el al. showed the DNA on short length scale has an elastic 
behavior distinct from that of WLC while Yuan et al. very recently have reported the 
persistence lengths are as short as 11 nm for DNA fragments consisting of 10 ~ 20 base-pair 

(bp) Q. 

In this paper we demonstrate that the higher flexibility of dsDNA emerges on shorter 
scales indeed due to local denaturation. Because of the large initiation energy, the fraction of 
the open bases is less than 1% at the physiological temperature, and, once formed, bubbles 
decay shortly in the order of 50 /is 6|, seemingly little affecting the DNA stability. We 
show that despite their rare and transient presence the bubbles give a drastic enhancement 
of the flexibility as the chain gets shorter. For a very short duplex fragment, another type 
of local denaturation, namely, forks at the free ends are entropically favorable, dominating 
over the bubbles to enhance the flexibility, and eventually driving the duplex to unbind into 
two single strands. 

It was suggested and estnnated that the basenips fl and Udnks Q can enhance the 
bending flexibility and looping probability. Based on a simple two state model, the looping 
probability was evaluated using transfer matrix method [3] • Although all these calculations 
are suggestive of their significance, the bubbles have not yet been explicitly accounted for 
with regard to their size distributions and realistic energetics. 

We consider a homogeneous dsDNA as the duplex of two interacting single strands de- 
scribed by the effective energy 

n = n 1 + n 2 + v 12 . (i) 
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The Hi is the elastic energy of the single strand i (=1 or 2), which, in a discrete represen- 
tation, is given by 

N-l N-l 



% = £ I A - 2r« + p& ,) 2 + £ f (|r« 1 - r»| - b)\ (2) 

n=2 n=l 

where r„ is the three-dimensional position vector of nth bead in strand % {n = 1, 2, • • •, N) 
and b (= 0.34 nm) is an average distance between neighboring beads within the ss. The 
first term accounts for bending energy with the bending modulus nb 3 for the ss, which 
is the related to its persistence length (L p ) via Kb 3 = LpksT Q. The second term, the 
stretching energy of each strand, is introduced to impose chain inextensibility condition. 
An appropriate value of k is numerically found by matching simulation of either force- 
extension curves or the mean end-to-end distances of ssDNA resulting from this model, with 
corresponding theoretical expression of the inextensible WLC without the stretching term. 
The V12 is the pairing energy between complimentary bases. To describe the bp openings 
due to thermal excitation, namely, thermal bubbles and forks, the interaction is represented 
by V12 = X) n =i Hi (V n ) where 

V n (r n ) = De- {rn - ro)/a [e- {r "- ro)/a - 2] (3) 

is the Morse potential jsl, Q, r n = — is the distance between nth bp, r and a are 
the bond distance and range which correspond to mean and fluctuation of DNA diameter 
respectively, and D is the potential depth. Whenever bps are unbound, the duplex becomes 
no more than two single strands with net persistence length as short as twice of L ss = 1 ~ 4 



nm llj . while for bound bps it takes L^s = 50 nm, the persistence length of long DNA, 
due to stacking interaction in the double strand. In order to include the stacking and 
destacking that cause such variation of the persistence length in the model, we consider 
K(n) = L p (n)kBT/2b 3 , and propose the persistence length per single strand L p (n) takes the 
form 

L p (n) = L ds /2 - (L ds /2 - L ss )6i(r n _i - r 1/2 )9{r n - r 1/2 )9(r n+1 - r 1/2 ). (4) 

Here 9{r) = [1 — erf(r/c)]/2 is a steplike form function which smoothly increases from to 
1 over the width c and ri/ 2 is the distance at which L p is (L ds /2 + L ss )/2 (see Fig. [[]). 

L, depends on three consecutive bp distances natoraily as it represents the cooperatrvity 
of the stacking interaction [121 ] . The variation of L p depending on the bp distance r is 
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FIG. 1: A schematic figure showing the profile of persistence length L p per single strand (dashed 
curve), along with the Morse potential V(r) (bold curve) as function of bp distance r. 

schematically illustrated in Fig. [1] When all of the three consecutive bps are outside the 
range ry2, namely they are unbound one other, L p takes the single-stranded persistence 
length L ss while as any one of the three are bound, L p is reduced to L^jl. The c is 
comparable to the Morse potential width a since the L p varies due to the bp unbinding. 

Using this energy model, we simulate the dynamics and equilibrium distribution of the 
bp distance, via the Langevin equation, 



dt 



.in 



(t) 



where frictional coefficient per base is V 
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1.88 x KT 11 kg s" 1 with rj =0.001 



kgm~ s for viscosity of water and R = 1 nm for effective radius of nucleotide. £ n is the 
Gaussian and white noise satisfying (£n2) = 0, (£,nl(t)^ a ,(t')) = 2Tk bT 'Sij8 nn > 8 aa > 5(t — t') for 
each Cartesian component a and a'. The values of potential depth and range, D = 0.07 eV, 
a = 0.05 nm are chosen so that the 300-bp DNA fragments have the melting temperature 
350 K 3. 

In our simulation, we employed 50 nm for the persistence length of dsDNA an d 4 

nm for that of ssDNA (L ss ) with the stretching constant k = 41 pN/nm. For the parameters 
in L p (n), T\ii was chosen to (r + r c )/2 = 2.1 nm where r c is the cutoff distance at which 
a bp is regarded as unbound in the simulation, and c is 0.045 nm 8j. We checked that 
small variation of the parameters values does not affect significantly the main result of our 
simulation. 

Figure [2] shows a snapshot profile of equilibrated bp distance along the contour of 300 
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FIG. 2: (a) A snapshot of bp distance in a duplex of 300-bp (~ 100 nm) long obtained from 
simulations of the Breathing DNA model. The shape of duplex thickness therefrom drawn manifests 
breathing, formation of bubble in the middle of the duplex and forks at the ends, (b) The simulation 
data on fraction of bound bp in the 300-bp long duplex vs. temperature. It shows a sharp transition 
into denaturation state at melting temperature (~ 350 K). At physiological temperature (~310 
K), the fraction less than 1% is denatured (unbounded) either as bubbles or forks. 

bp long or about 100 nm, with free ends, and the fluctuating DNA thickness of about 2 nm 
therefrom constructed. The figure shows indeed the local denaturation exist in the forms of 
bubbles within the contour and fork at the ends. From the simulation we find the fraction 
of bound pairs undergoes a sharp transition from near unity to zero in agreement with 



experiment [13| at the melting temperature, which is about 350 K. It indicates that at the 
physiological temperature (310 K), the fraction of the bp that forms the local denaturation 
is less than 1%. 

Shown in the Fig. [3] are the size distributions of the bubbles (P&(n)) and forks (P/(n)) for 
the various contour lengths of the DNA fragments. The distribution of the bubble size with 
n bp open is remarkably independent of its position and the DNA length. For relatively 
large bubbles it follows the Poland-Sherega form 
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FIG. 3: (a) The bubble size distribution P&(n) at 310 K for contour length L = 55, 100, 150, and 
300 bp. (b) The fork size distribution Pf(n) at 310 K for the same lengths. 

e -nA 

P b (n) ~ — , (6) 

where a is the statistical factor for a loop formation and A is the average energy in k^T 
to unbind a bp. These factors are independent of the length and given by those for a long 
length, a = 1.7, A = 0.33, at 310 K jg]. The average size of the bubbles for various contour 
lengths is 1.7 bp at 310 K. This is in agreement with the Peyrard-Bishop-Dauxois model 
simulation 15[. The length-independence of their distribution and small average size implies 
that the thermal bubbles are transiently excited due to short-range elastic interaction in 
unconstrained DNA. This is to be contrasted with the dsDNA under mechanical constraints, 
where the average bubble size is found to be much larger depending on the (negative) twist 
as well as the contour length The distribution of n bp open at a free end is given by 

e -nA 

where = 1.1, A = 0.33 are also independent of the contour length within the errors. 
The average size of the forks is 1.9 bp. Since L ss is 4 nm in these simulations, they are 
semiflexible forks with (3 much larger than that of the flexible forks [l^] 

Last, we obtain the JV&(m), the relative probability of finding m bubbles simultaneously in 
a DNA of given length. The result obtained from simulation is presented in Fig. HJ showing 
that the single bubble occurrence becomes predominant as the DNA length gets shorter. 
This is mainly due to the large energy cost of bubble initiation; namely, once a bubble 
is formed, increasing its size is energetically more favorable than opening a new bubble 
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FIG. 4: The relative probability of finding m bubbles within three DNA lengths at 310 K. 
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FIG. 5: Effective persistence length of the DNA fragments of the length L taken from inside a 300 
bp long DNA (empty circles) This is compared with theoretical curve (line) obtained assuming a 
single bubble of the size given by the mean, 1.7 bp. 

elsewhere. Within the short DNAs of our interest, the number of bubbles, if they exist at 
all, can be regarded to be unity in a good approximation. 

To focus on the effects of bubble (excluding the end fork effect) on the overall duplex 
persistence length, we consider an ensemble of fragments with given contour length L ran- 
domly taken from inside a 300-bp long (~ 100 nm) dsDNA at a physiological temperature. 
The square of duplex end-to-end distance, Rg e , is denned as arithmetical average of the 
end-to-end distances of the two single strands. The thermal average of its square (R.g e ) is 
taken over the ensemble of the duplex with a given L. Suppose that the duplex to be a 
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WLC with a uniform persistence length Lfff, we use the well-known relation 



<R e 2 e > = 2LfL - 2{Lf f[l - e- L '#']. (8) 

Comparing this with simulation data, we evaluate Ly* of our duplex for various values of 
the contour length L. The result shown in Fig. [5] indicates that Lif* decreases from the 
value 50 nm to 40 nm as the L decreases to 10 nm. It reflects the enhanced flexibility of 
the duplex, which we attribute to the presence of thermal bubbles. Along the DNA length 
shorter than 50 nm, if any, a single bubble is most likely to exist, as shown in Nb(m), Fig. HI 
If the single bubble survives as the contour length decreases, its mean size does not change 
as discussed earlier, consequently yielding larger flexibility. 

To support this argument quantitatively, we note that the mean end-to-end distance 
is given by (Rg e ) = dsds'(t(s) ■ t(s')) 18], where t(s) is the unit tangent vector 

at the arclength s from an end of the duplex. The tangent correlation function is given by 
(t(s)-t(s')) = exp{ — \s' — s\/Lds] if they are the positions within ds region and (t(s)-t(s')} = 
exp{— \s' — s\/2L ss } if s and s' are the points within the bubble region. Integrating the 
correlation function along the contour with a bubble of the mean size, 1.7 bp, we evaluate 
the (Rg e ) . Since a bubble occurs with equal probability along the contour we further average 
the mean square end-to-end distance over all possible position of the bubble, and relate it 
with the effective persistence length, which is shown by the curve in Fig. [5j 

For the length larger than 10 nm, this analytical theory agrees with simulation result 
remarkably, suggesting that the persistence length reduction is indeed due to a single bubble. 
As the length decreases below that, this curve departs much from the simulation result for 
the length shorter than that. It is because even a single bubble is unlikely to occur within 
such a short length so that the persistence length sharply rises to the double strand value. 
For a long DNA fragment, on the other hand, only a bubble is found in most cases with its 
average size fixed independently of DNA length, which results in no significant perturbations 
on the persistence length. 

For the short duplex with free ends, we also have investigated the mean end-to-end dis- 
tances of DNA fragments length shorter than 50 nm (147 bp) at 310 K. The persistence 
length calculated in a similar way from these data is shown by triangles in Fig. |6j It clearly 
shows that L e J* , in this range of the contour length, are shorter than Ld s = 50 nm but 
converges to the value as the contour length increases. The persistence length in this case 
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FIG. 6: Effective persistence length of the fragment of DNA with contour lengths L, where bubbles 
as well as forks exist (triangle). It is in a good agreement with the WLC theoretical curve with 
a bubble of the size given as its average, 1.7 bp and two forks of the size given as its average, 1.9 
bp, and with the results of ref. |f| obtained by a fluorescence resonance energy transfer (FRET) 
experiment. 

is shorter than that of the duplex of the same contour length without the free ends dis- 
cussed before (indicated by circles). This can be ascribed to the additional form of local 
denaturation, i.e., the forks at the free ends. To support this quantitatively, we analytically 
calculated the effective persistence length of the duplex with a single bubble of the size 
1.7 bp and two forks of the size 1.9 bp at the ends by integrating the correlation function 
and following the procedure as before. The close agreement of the theoretical curve with 
the simulation evidences additional influence of the forks; the two forks combines with and 
increasingly dominates over a bubble, to enhance flexibility, as the duplex gets shorter. 



In ref. the effective persistence length of dsDNAs of contour length 15 ~ 21 bp 
(equivalently 5 ~ 7 nm) was determined to be 1 1 ± 2 nm by a fluorescence resonance energy 
transfer (FRET) experiment, which is marked by a cross in Fig. [6j They considered the 
buffer conditions where the persistence lengths of non-interacting single stranded DNA have 



the range of the values 2.7-3 nm 11|. In a duplex, however, the two unpaired single strands 



are subject to steric and electrostatic repulsions, which may enhance this ss persistence 
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length effectively to about 4 nm. Indeed, our values of L e Jf estimated from simulation with 
L ss (=4 nm) are in a good agreement with this experimental value. 

Simulating the Breathing DNA model that incorporates locally fluctuating persistence 
lengths depending upon the bp distances along the contour, we find that the distributions 
of bubbles and forks are nearly independent of the contour length, reducing the effective 
persistence length of the duplex for the short lengths. The forks dominate the bubbles in 
enhancing duplex flexibility and inducing unbinding transition. 
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